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Abstract: We show that for any sequence /(l),/(2),... taking values in { —1,+1}, the 
discrepancy 

n 

sup Y^fUd) 

j=\ 

of / is infinite. This answers a question of Erdos. In fact the argument also applies to 
sequences / taking values in the unit sphere of a real or complex Hilhert space. 

The argument uses three ingredients. The first is a Fourier-analytic reduction, obtained 
as part of the Polymaths project on this problem, which reduces the problem to the case 
when / is replaced by a (stochastic) completely multiplicative function g. The second is 
a logarithmically averaged version of the Elliott conjecture, established recently by the 
author, which effectively reduces to the case when g usually pretends to be a modulated 
Dirichlet character. The final ingredient is (an extension of) a further argument obtained by 
the Polymaths project which shows unbounded discrepancy in this case. 
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1 Introduction 


Given a sequence /: N —)■ // taking values in a real or complex Hilbert space H, define the discrepancy 
of / to be the quantity 


sup 


H 


Y 

;=i 

In other words, the discrepancy is the largest magnitude of a sum of / along homogeneous arithmetic 
progressions {d, 2d, ...,nd} in the natural numbers N = {1,2,3,...}. 

The main objective of this paper is to establish the following result: 
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Theorem 1.1 (Erdos discrepancy problem, vector-valued case). Let H be a real or complex Hilbert space, 
and let f'. ^ ^ H be a function such that ||/(n) ||// = 1 for all n. Then the discrepancy of f is infinite. 

Specialising to the case when H is the reals, we thus have 

Corollary 1.2 (Erdos discrepancy problem, original formulation). Every sequence /(l),/(2),... taking 
values in {—1, -|-1} has infinite discrepancy. 

This answers a question of Erdos [9] (see also Cudakov [7] for related questions), which was recently 
the subject of the Polymaths project [19]; see the recent report [10] on the latter project for further 
discussion. 

It is instructive to consider some near-counterexamples to these results - that is to say, functions 
that are of unit magnitude, or nearly so, which have surprisingly small discrepancy - to isolate the key 
difficulty of the problem. 

Example 1.3 (Dirichlet character). Let X - N —)• C be a non-principal Dirichlet character of period q. 
Then % is completely multiplicative (thus ;^(nm) = x{^)x{^) for ^ny n,m G N) and has mean zero on 
any interval of length q. Thus for any homogeneous arithmetic progression {d,2d,... ,nd} one has 


L XUd) 




Xid)f^xU) 


,7=1 


<q 


and so the discrepancy of x is at most q (indeed one can refine this bound further using character sum 
bounds such as the Burgess bound [3]). Of course, this does not contradict Theorem 1.1 or Corollary 
1.2, even when the character x is real, because x{n) vanishes when n shares a common factor with q. 
Nevertheless, this demonstrates the need to exploit the hypothesis that / has magnitude 1 for all n (as 
opposed to merely for most n). 


Example 1.4 (Borwein-Choi-Coons example). [2] Let x^ be the non-principal Dirichlet character of 
period 3 (thus X^i^) equals -|-1 when n = 1 (3), —1 when n = 2 (3), and 0 when n = 0 (3)), and define 
the completely multiplicative function X 3 - N—>^{ —l,-|-l}by setting Xsip) •= Xsip) when p 3 and 
^ 3 ( 3 ) = -|-1. This is about the simplest modification one can make to Example 1.3 to eliminate the zeroes. 
Now consider the sum 

i^Hj) 

i=i 

-|- 3^ for some large k. Writing j = 3'm with m coprime to 3 and i at most k, we 

£ £ Z3(3'm). 

i—0 \ <m<n/3j 

Now observe that X3{3'm) = X3{^)'X3{^) = X3{^)- The function X 3 has mean zero on every interval of 
length three, and \ n/3^\ is equal to 1 mod 3, hence 

£ X3{3'm) = 1 

\<m<n/3}'.(m,3) — \ 


with n := 1 -|-3 -|-3^ H- 

can write this sum as 
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for every i = 0,... ,k. Summing in i, we conclude that 

n 

=^+l >log«. 

;=i 

More generally, for natural numbers n, L/=i X^U) is equal to the number of Is in the base 3 expansion of 
n. Thus X 3 has infinite discrepancy, but the divergence is only logarithmic in the n parameter; indeed from 
the above calculations and the complete multiplicativity of Xs we see that sup„<yv.^gji^ | ^ 7=1 X 3 U^)\ is 
comparable to log N ioT N > 1. This can be compared with random sequences / : N—)•{ —1,+1}, whose 
discrepancy would be expected to diverge like See the paper of Borwein, Choi, and Coons 

[ 2 ] for further analysis of functions such as ^ 3 , which seems to have been first discussed in [ 20 ]; see 
also [4] for some further discussion of the sign patterns in X 3 - One can also reduce the discrepancy of 
this example slightly (by a factor of about two) by changing the value of the completely multiplicative 
function X3 at 3 from +1 to — 1. 

If one lets be a primitive Dirichlet character whose period is a prime p = I (4), and defines X 
similarly to ^3 above, then the partial sums T!j=i XU) remain unbounded, but the Cesaro sum (1 — 
i)xU) remains bounded! This observation^ can be derived from aroutine application of Perron’s formula, 
together with the functional equation for L{s,x)^ which in the p = 1 (4) case establishes a zero of L{s,x) 
at 5 = 0. Thus it is possible for the “Cesaro smoothed discrepancy” of a { —l,+l}-valued sequence to be 
finite. 


Example 1.5 (Vector-valued Borwein-Choi-Coons example). Let Hhe a real Hilbert space with orthonor¬ 
mal basis 60 , 61 , 62 , ■■■, let X 3 t>e the character from Example 1.4, and let /: N —)• // be the function 
defined by setting f{3°m) := X 3 {^)^a whenever a = 0,1,2... and m is coprime to 3. Thus / takes values 

in the unit sphere of H. Repeating the calculation in Example 1.4, we see that ifn= 1 -|- 3 -|- 3^ -|-[-3^, 

then 

n 

Y^fU)=eo^ - V 6 k 

j=i 


and hence 


Conversely, if n is a natural 
Pythagoras’s theorem that 


L fU) 

./=i 


= \/k+l > Y^log? 


H 

number and d = 3^d' for some ( = 0,1,... and d' coprime to 3, we see from 


n 



£ fijd) 

= 

L ei+i Y. X3{md') 


H 

/>0:3'</i m<n/y 





< 

E ') 






\/logn. 


Thus the discrepancy of this function is infinite and diverges like -^/logiV. 
'Bill Duke, personal communication 
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Example 1.6 (Random Borwein-Choi-Coons example). Let g:N—)•{ —l,+l}be the stochastic (i.e. 
random) multiplicative function defined by setting g(n) = X^{^) is coprime to 3, and g(3i) := Ej for 
j = 1,2,3,, where X 3 is as in Example 1.4 and 81 , 82 ,-" £ {“!)+!} are independently identically 
distributed signs, attaining —1 and +1 with equal probability. Arguing similarly to 1.5, we have 




1/2 


= I E £ 8m £ X3{md'] 

i>03'<n m<n/3' 

<( E 1)"' 

<C y/logn. 


1/2 


where to reach the second line we use the additivity of variance for independent random variables. Thus 
g in some sense has discrepancy growth like yfiog/V “on the average”. (Note that one can interpret this 
example as a special case of Example 1.5, by setting H to be the Hilbert space of real-valued square- 
integrable random variables.) However, by carefully choosing the base 3 expansion of n depending on the 
signs (similarly to Example 1.4) one can show that 


sup 



> 


k+l 

2 


and so the actual discrepancy grows like logA. So this random example actually has essentially the 
same discrepancy growth as Example 1.4. We do not know if scalar sequences of significantly slower 
discrepancy growth than this can be constructed. 

Example 1.7 (Numerical examples). In [12] a sequence f{n) supported on n < A := 1160, with values 
in { —1,+1} in that range, was constructed with discrepancy 2 (and a SAT solver was used to show that 
1160 was the largest possible value of N with this property). A similar sequence with N = 130000 of 
discrepancy 3 was also constructed in that paper, as well as a sequence with A = 127 645 of discrepancy 
3 that was the restriction to {1,... ,A} of a completely multiplicative sequence taking values in {—1, +1} 
(with the latter value of 127 645 being the best possible value of A; see also [1] for a separate computation 
confirming this threshold). This slow growth in discrepancy may be compared with the -^/logA type 
divergence in Example 1.5. 


The above examples suggest that completely multiplicative functions are an important test case for 
Theorem 1.1 and Corollary 1.2; the importance of this case was already isolated in [9]. More recently, the 
Polymaths project [19] obtained a number of equivalent formulations of the Erdos discrepancy problem 
and its variants, including the logical equivalence of Theorem 1.1 with the following assertion involving 
such functions. We define a stochastic element of a measurable space A to be a random variable g taking 
values in X, or equivalently a measurable map g: —)■ A from an ambient probability space (f2,/i) 
(known as the sample space) to A. 
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Theorem 1.8 (Equivalent fonn of vector-valued Erdos discrepancy problem). Let g: N —)• 5^ Zjc (3 
stochastic completely multiplicative function taking values in the unit circle := {z G C : |z| = 1} (where 
we give the space (5^)^ of functions from N to S' the product o-algebra). Then 


supE 

n 



= -j-oo. 


By converting all the probabilistic language to measure-theoretic language, Theorem 1.8 has the 
following equivalent form: 

Theorem 1.9 (Measure-theoretic formulation). Let ft) be a probability space, and let g : (S')'^ 

be a measurable function to the space (S')'^ of functions from N to S', such that g(co) G (S')'^ is 
completely multiplicative for p-aimost every (O G Q. (that is to say, g{(o){nm) = g{(o){n)g((o){m) for all 
G N and p-almost all (0 G LI). Then one has 


sup 

n 



dp((o) = -I- oo. 


The equivalence between Theorem 1.1 and Theorem 1.8 (or Theorem 1.9) was obtained in [19] using 
a Fourier-analytic argument; for the convenience of the reader, we reproduce this argument in Section 2. 
The close similarity between Example 1.5 and Example 1.6 can be interpreted as a special case of this 
equivalence. 

It thus remains to establish Theorem 1.8. To do this, we use a recent result of the author [21] regarding 
correlations of multiplicative functions: 

Theorem 1.10 (Logarithmically averaged nonasymptotic Elliott conjecture). [ 21, Theorem 1.3]Let ai,a 2 
be natural numbers, and let bi,b 2 be integers such that a\b 2 —a 2 bi 7^ 0. Lets >0, and suppose that A is 
sufficiently large depending on e,a\,a 2 -,b\,b 2 . Letx >w>A, and letg\,g 2 '. N —)■ C he multiplicative 
functions with |gi (n) |, |g 2 (n) | < 1 for all n, with “non-pretentious” in the sense that 


I 


l-Regi{p)x{p)p 

P 


>A 


( 1 . 1 ) 


for all Dirichlet characters % of period at most A, and all real numbers t with |t| < Ax. Then 


I 

x/w<n<x 


g\{a\n + b\)g2{a2n + b2) 
n 


< £ log ft). 


( 1 . 2 ) 


This theorem is a variant of the Elliott conjecture [8] (as corrected in [16]), which in turn is a 
generalisation of a well known conjecture of Chowla [5]. See [21] for further discussion of this result, 
the proof of which relies on a number of tools, including the recent results in [14], [16] on mean values 
of multiplicative functions in short intervals. It can be viewed as a sort of “inverse theorem” for pair 
correlations of multiplicative functions, asserting that such correlations can only be large when both of 
the multiplicative functions “pretend” to be like modulated Dirichlet characters n 1 —)■ x{n)n‘L 
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Using this result and a standard van der Corput argument, one ean show that the only potential 
counterexamples to Theorem 1.8 come from (stochastic) completely multiplicative functions that usually 
“pretend” to he like modulated Dirichlet characters (cf. Examples 1.3, 1.4, 1.6). More precisely, we have 


Proposition 1.11 (van der Corput argument). Suppose that g : N —)• S' a stochastic completely multi¬ 

plicative function, such that 


n 


2 


E 


X 8(^') 

,/=i 


<C^ 


(1.3) 


for some finite C > 0 and all natural numbers n (thus, g would be counterexample to Theorem 1.8). 
Let e > 0, and suppose that X is sufficiently large depending on E,C. Then with probability 1 — 0(e), 
one can find a (stochastic) Dirichlet character x of period q = Oc,e(l) <^nd a (stochastic) real number 
t = Oc.e ) such that 


I 

p<X 


l-Reg(p)x(p)p 

P 


ii 

— ^C,e 


1 . 


(1.4) 


(See Section 1.1 helow for our asymptotic notation conventions.) We give the (easy) derivation of 
Proposition 1.11 from Theorem 1.10 in Section 3. One can of course reformulate Proposition 1.11 in 
measure-theoretic language if desired, much as Theorem 1.8 may he reformulated as Theorem 1.9; we 
leave this to the interested reader. Of course, Theorem 1.8 implies that the hypotheses of Proposition 1.11 
cannot hold, and so Proposition 1.11 is in fact vacuously true; nevertheless it is necessary to establish this 
proposition independently of Theorem 1.8 to avoid circularity. 

It remains to demonstrate Theorem 1.8 for random completely multiplicative functions g that obey 
(1.4) with high probability for large X and small e. Such functions g can be viewed as (somewhat 
complicated) generalisations of the Borwein-Choi-Coons example (Example 1 .4), and it turns out that a 
more complicated version of the analysis in Example 1.4 (or Example 1.5) suffices to establish a lower 
bound for E| L/=i §(7)1^ (of logarithmic type, similar to that in Example 1.5) which is enough to conclude 
Theorem 1.8 and hence Theorem 1.1 and Corollary 1.2. We give this argument in Section 4. 

In principle, the arguments in [21] provide an effective value for A as a function of e,a\,a 2 ,b\,b 2 in 
Theorem 1.10, which would in turn give an explicit lower bound for the divergence of the discrepancy in 
Theorem 1.1 or Corollary 1.2. However, this bound is likely to be far too weak to match the ydogiV type 
divergence observed in Example 1.5. Nevertheless, it seems reasonable to conjecture that the -y/IogTV 
order of divergence is best possible for Theorem 1.1 (although it is unclear to the author whether such a 
slowly diverging example can also be attained for Corollary 1.2). 

The arguments in this paper can also be used to partially classify the multiplicative (but not completely 
multiplicative) functions taking values in { —1,+1} that have bounded partial sums; see Section 5. 


Remark 1.12. In [19, 10], Theorem 1.1 was also shown to be equivalent to the existence of sequences 
(cm.d)m,dm^ (^n)«GN of non-ncgativc reals such that Jfm,dCm,d = h = °°> ^nd such that the real 
quadratic form 

('in.d (ttd T ^2d T ‘ ' T bnX^ 

m^d fi 

is positive semi-definite. The arguments of this paper thus abstractly show that such sequences exist, but 
do not appear to give any explicit construction for such a sequence. 
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Remark 1.13. In [10, Conjecture 3.12], the following stronger version of Theorem 1.1 was proposed: 
if C > 0 and N is sufficiently large depending on C, then for any matrix {aij)i<ij<N of reals with 
diagonal entries equal to 1, there exist homogeneous arithmetic progressions P = {d,2d,. ..,nd} and 
Q = {d',2d',... ,n'd'} in {1,... ,N} such that 

I Y. L ^ c. 

iePjeQ 

Setting aij := {f {i), f (j)) h, we see that this would indeed imply Theorem 1.1 and thus Corollary 1.2. 
We do not know how to resolve this conjecture, although it appears that a two-dimensional variant of 
the Fourier-analytic arguments in Section 2 helow can handle the special case when aij = ±1 for all i,j 
(which would still imply Corollary 1.2 as a special case). We leave this modification of the argument to 
the interested reader. 

Remark 1.14. As a further near-counterexample to Corollary 1.2, we present here an example of a 
sequence / :N— )>{—l,-|-l}for which 


sup 

mGN 






OO 


for each d (though of course the left-hand side must he unbounded in d, thanks to Corollary 1.2). We set 
/(I) := 1, and then recursively for D = 1,2,... we define 

f{jD\ + k) :={-iyf{k) 

for k = 1,... ,D! and j = I,... ,D. Thus for insfance fhe firsf few elemenfs of fhe sequence^ are 


1 ,- 1 ,- 1 , 1 , 1 ,- 1 ,- 1 , 1 , 1 ,- 1 ,- 1 , 1 , 1 ,- 1 ,- 1 , 1 , 1 ,- 1 ,.... 

If d is anafural number and D is any odd number larger fhan cf, fhen we see fhaf fhe block/((f),/(2<i),... ,/((D-|- 
1)!) is D -f 1 alfemafing copies of f{d),f{2d),... ,f{D\) and fhus sums to zero; in facl if we divide 
f{d),f{2d),... into consecutive blocks of lengfh (D -f 1)!/<i fhen all such blocks sum to zero, and so 
sup„ I f{jd) \ is finite for all d. 

Remark 1.15. There is a curious superficial similarify befween fhe argumenfs in fhis paper and fhe 
Hardy-Lifflewood circle mefhod. In fhe latter, Fourier analytic argumenfs are used to reduce matters 
to estimates on “major arcs” and “minor arcs”; in fhis paper, Fourier analytic argumenfs are used to 
reduce maffers to esfimafes for “prefenfious mulfiplicafive funcfions” and “non-prefenfious multiplicative 
funcfions”. We do nol know if fhere is any deeper significance to fhis similarify. 


1.1 Notation 

We adopf fhe usual asymptotic nofafion of A ^ F, T » X, or A = 0(F) to denote fhe assertion fhaf 
|A| < CF for some consfanf C. If we need C to depend on an additional parameter we will denote fhis 

2oEIS A262725 
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by subscripts, e.g. X = Oe{Y) denotes the bound |X| < CgT for some Cg depending on e. For any real 
number a, we write e{a) := 

All sums and products will be over the natural numbers N = {1,2,...} unless otherwise specified, 
with the exception of sums and products over p which is always understood to be prime. 

We use d\n to denote the assertion that d divides n, and n (d) to denote the residue class of n modulo 
d. We use {a,b) to denote the greatest common divisor of a and b. 

We will frequently use probabilistic notation such as the expectation EX of a random variable X 
or a probability E(£') of an event E. We will use boldface symbols such as g to refer to random (i.e. 
stochastic) variables, to distinguish them from deterministic variables, which will be in non-boldface. 


2 Fourier analytic reduction 

In this section we establish the logical equivalence between Theorem 1.1 and Theorem 1.8 (or Theorem 
1.9). The arguments here are taken from a website^ of the Polymaths project [19]. 

The deduction of Theorem 1.9 from Theorem 1.1 is straightforward: if (fl, /r) and g are as in Theorem 
1.9, one takes H to he the complex Hilhert space L^(fl,/r), and for each natural number n, we let f{n) G H 
be the function 

f{n) : (0^g{(0){n). 

Since g{(o){n) G f{n) is clearly a unit vector in H. For any homogeneous arithmetic progression 
{d,2d, ...,nd}, one has 


L fUd) 


H 



2 

dii{(o) 


diJL{co) 


and on taking suprema in n and d we conclude that Theorem 1.9 follows from Theorem 1.1. (Note that 
this argument also explains the similarity between Example 1.6 and Example 1.5.) 

Since Theorem 1.9 is equivalent to Theorem 1.8, it remains to show that Theorem 1.8 implies Theorem 
1.1. We take contrapositives, thus we assume that Theorem 1.1 fails, and seek to conclude that Theorem 
1.8 also fails. By hypothesis, we can find a function / : N —)> // taking values in the unit sphere of a 
Hilbert space H and a finite quantity C such that 


L fiid) 


i=i 


H 


<c 


( 2 . 1 ) 


for all homogeneous arithmetic progressions d,2d,... ,nd. By complexifying H if necessary, we may 
take // to be a complex Hilbert space. To obtain the required conclusion, it will suffice to construct a 

^michaelnielsen.org/polymathl/index.php?title=Fourier_reduction 
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random completely multiplicative function g taking values in 5^ such that 


E 



<Ccl 


for all n. 

We claim that it suffices to construct, for each X >1, a. stochastic completely multiplicative function 
gx taking values in such that 

2 


n 


E 


LgxO') 

j=i 


<Cc 1 


( 2 . 2 ) 


for all n <X, where the implied constant is uniform in n and X, but we allow the underlying probability 
space defining the stochastic function gx to depend on X. This reduction is obtained by a standard 
compactness argument^, but we give the details for sake of completeness. Suppose that for each X, 
we have such a gx obeying (2.2) as above. Let M be the space of completely multiplicative functions 
g : N —)• S* from N to one can view this space as isomorphic to an infinite product of S'’s, since 
completely multiplicative functions are determined by their values at the primes. In particular, M is 
a compact metrisable space; it can be viewed as a compact subspace of the space (S^)^ of arbitrary 
functions (not necessarily multiplicative) from N to . 

Just as Theorem 1.8 is equivalent to Theorem 1.9, we can view each gx as a measurable map 
fx : f2x —)• M, such that 

2 


/ Q.X 




;=i 


dl-ixi(o) <Cc 1 


for all n < X. We can then define a Radon probability measure Vx on M to be the probability distribution 
(or law) of the random variable gx, or equivalently the pushforward of the measure /Tx via fx- That is to 
say. 


[ F{g) dvxig) =EF{gx) 

JM 

= [ F{fx{(o)) dnx{(0) 

J six 

for any continuous function F : M —)• C. The functions g i—)> \ Y!j=i sij)\^ continuous on M, and hence 



'Cc 1 


^If one wished to obtain a more quantitative version of Theorem 1.1, one would avoid this compactness argument and work 
instead with truncated versions of Theorem 1.8 (or Theorem 1.9) in which one restricts the n parameter to be less than some 
large cutoff. This would then require similar truncations to be made in the arguments in later sections, which in particular 
requires some treatment of error terms created when truncating Euler products, but such errors can be made negligible by 
making the truncation parameter extremely large with respect to all other parameters. We leave the details of this reformulation 
of the argument to the interested reader. 
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for all n <X. By vague compactness of probability measures on compact metrisable spaces such as M 
(Prokhorov’s theorem), we can thus extract a subsequence VXj of the Vx with Xj —)■ oo such that the VXj 
converge to a Radon probability measure V on M, that is to say 


/ F{g) dvxjig)^ f F{g)dvx{g) 

Jm Ju 

as 7 —)■ oo for all continuous functions F : M —)> C. Applying this in particular to the continuous functions 
g^\Vj=igOt we conclude that 



1 


for all n. We then define the random completely multiplicative function g : N —)■ (or equivalently, a 
measurable map from a probability space to M) by choosing (M, v) as the underlying probability space, 
and using the identity function g i—)• g as the measurable map. We then have 


E 



1 


for all n, and the claim follows. 

It remains to construct the random multiplicative functions gx for each X. Let X > 1, and let p i,..., 
be the primes up to X. Let M > A be a natural number that we assume to be sufficiently large depending 
on C,X. Define a function F: —)■ // by the formula 

F{a\ {M),...,ar (M)) :=/(p"'.. .p^'') 

for ai,...,arG{0,...,M — 1}, thus F takes values in the unit sphere of //. We also define the function 
71: [1,A] —)• (Z/MZ)'' by setting 7r(pj' ...p)!'') := {ai,...,ar) whenever p^' ...p"'' is in the discrete 
interval 

[l,X] :={neN: 1 <n<A}; 

note that n is well defined for M > X. Applying (2.1) for n < X and d of the form Pj'.. .p°/ with 
1 <ai<M — X,v/e. conclude that 


Y,F{x + 7i{j)) 


1 


for all 71 < A and all but Ox (M'' ') of the AF elements x= ,x,.) of {’L/M'LY. For the exceptional 

elements, we have the trivial bound 


n 


Y,F{x + K{j)) 
,/=i 


< 71 < A 

H 
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from the triangle inequality. Square-summing in x, we conclude (if M is sufficiently large depending on 
C,X) that 


1 

W 


x&{'L/M'Ly 


Y,F{x + 7i{j)) 

;=i 


"Cc 1- 


(2.3) 


H 


By Fourier expansion, we can write 


F{x)= X 


where (xi,... ,Xr) ■ ■= xi^i y - hXr^r, and the Fourier transform F: (Z/MZ)'' H is 

defined by the formula 


M 


A routine Fourier-analytic calculation (using the Plancherel identity) then allows us to write the left-hand 
side of (2.3) as 


^6 


E I|f(5)il 


H 


f=l 


M 


On the other hand, from a further application of the Plancherel identity we have 

E IIF5)II?, = i 


5€ 


and so we can interpret ||F(i^) |||/ as the probability distribution of a random frequency = (i^ ..., i^^) G 
(Z/MZ)'' (using (Z/MZ)'' as the underlying sample space). The estimate (2.3) now takes the form 


E 



(^) 


<Cc 1 


for all n < X. If we then define fhe stochastic completely multiplicative function gx by setting gx{Pj) •= 
e{^j/M) for j = l,...,r, andgx(p) := 1 for all other primes, we obtain 


E 


n 


LgxO') 

;=i 


<Cc 1 


for all n < X, as desired. 

Remark 2.1. It is instructive to see how the above argument breaks down when one tries to use the 
Dirichlet character example in Example 1.3. While % often has magnitude 1 in the ordinary (Archimedean) 
sense, the function (ai,... ,a^) i—x(Pi' • • ■P‘^/) is almost always zero, since the argument of % 

is likely to be a multiple of q. As such, the quantity ||F’(i^) ||^ sums to something much less than 1, and 
one does not generate a stochastic completely multiplicative function g with bounded discrepancy. 
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Remark 2.2. The above arguments also show that Theorem 1.1 automatically implies an apparently 
stronger version^ of itself, in which one assumes \\f{n) ||// > 1 for all n, rather than ||/(n) ||// = 1- Indeed, 
if / has bounded discrepancy then it must be bounded (since f{n) is the difference of Lj=i fU) ^^d 
fU))^ the above arguments then carry through; the sum L(^g(z/mz)' \\P{^ ) 11^ iiow greater than 
or equal to 1, but one can still define a suitable probability distribution from the ||T’((§) |||^ by normalising. 


Remark 2.3. If Theorem 1.8 failed, then we could find a constant C > 0 and a stochastic completely 
multiplicative function g: N —)■ 5' such that 


E 



2 

<C^ 


for all n. In particular, by the triangle inequality we have 




2 

<C^ 


and hence for each N, there exists a deterministic completely multiplicative function N —)■ 5^ such 
that 


1 

N 


N 


n—\ 


;=i 


<c2. 


Thus, to prove Theorem 1.8 (and hence Theorem 1.1 and Corollary 1.2), it would suffice fo obfain a lower 
bound of the form 


1 

N 


N 


I 

n=\ 



> (o{N) 


(2.4) 


for all deterministic completely multiplicative functions g: N —)■ all > 1, and some function co{N) 
of N that goes to infinity a&N ^ This was in fact the preferred form of the Fourier-analytic reduction 
obtained by the Polymaths project [19], [10]. It is conceivable that some refinement of the analysis in 
this paper in fact yields a bound of the form (2.4), though this seems to require removing the logarithmic 
averaging from Theorem 1.10, as well as avoiding the use of Lemma 4.1 below. 


3 Applying the Elliott-type conjecture 


In this section we prove Proposition 1.11. Let g,C,e be as in that proposition. Let // > 1 be a moderately 
large natural number depending on e to be chosen later, and suppose thatX is sufficiently large depending 
on H, e. From (1.3) and the triangle inequality we have 


E y - 

VX<n<X ” 


n 


Y 8(^') 


2 

<CclogX; 


^This version was suggested by Harrison Brown in the Polymaths project. 
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a similar argument (for X large enough) gives 


E y - 

VX<n<X ” 


n+H 






and hence by the triangle inequality 


E y - 

VX<n<X ” 


n+H 


£ g(i) 

j=n+l 


<Cc logX. 


Thus from Markov’s inequality we see with probability 1 — 0(e) that 


L 

y/X<n<X 


1 

n 


n+H 

£ g(i) 


j=n+l 


2 

<C,e logX, 


which we rewrite as 


£ 

y/X<n<X 


1 

n 


H 


£g(n+/j) 

/i=i 


2 


<C,e logi^f. 


We can expand out the left-hand side of (3.1) as 


(3.1) 


^ ^ g{n + hi)g(n + h2) 

hi,h2e[i,H]VX<n<X ” 

The diagonal term hi,h 2 contributes a term of size ^ HlogX to this expression. Thus, choosing H to 
be a sufficiently large quantity depending on C, e, we can apply the triangle inequality and pigeonhole 
principle to find distinct (and stochastic) hi,h 2 G [1,^] such that 


£ 

s/X<n<X 


g(n + hi)g(n + h2) 
n 


^C,e,H logX. 


Applying Theorem 1.10 in the contrapositive, we obtain the claim. (It is easy to check that the quantities 
X,t produced by Theorem 1.10 can be selected to be measurable, for instance one can use continuity to 
restrict t to be rational and then take a minimal choice of t) with respect to some explicit well-ordering 
of the countable set of possible pairs {xd)-) 

Remark 3.1. The same argument shows that the hypothesis |g(n)| = 1 may be relaxed to |g(?i)| < 1, and 
g need only be multiplicative rather than completely multiplicative, provided that one has a lower bound 
of the form ^ log^. Thus the Dirichlet character example in Example 1.3 is in some 

sense the “only” example of a bounded multiplicative function with bounded discrepancy that is large 
for many values of n, in that any other such example must “pretend” to be like a (modulated) Dirichlet 
character. (We thank Gil Kalai for suggesting this remark.) 
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4 A generalised Borwein-Choi-Coons analysis 


We can now complete the proof of Theorem 1.8 (and thus Theorem 1.1 and Corollary 1.2). Our arguments 
here will be based on those from a website^ of the Polymaths project [19], which treated the case in 
which the functions g and % appearing in Proposition 1.11 were real-valued (and the quantity t was set to 
zero). 

Suppose for contradiction that Theorem 1.8 failed^, then we can find a constant C > 0 and a stochastic 
completely multiplicative function g: N —>• S* such that 


E 


n 


L 8 O') 

7=1 


2 


for all natural numbers n. We now allow all implied constants to depend on C, thus 


E 



< 1 


for all n. The stochastic nature of g is a mild technical nuisance for our arguments, but the reader may 
wish to assume g as a deterministic completely multiplicative function for a hrst reading, as this case 
already captures the key aspects of the argument. 

We will need the following large and small parameters, selected in the following order: 

• A quantity 0 < e < 1/2 that is sufficiently small depending on C. 

• A natural number H >\ that is sufficiently large depending on C, e. 

• A quantity 0 < 5 < 1/2 that is sufficiently small depending on C,e,H. 

• A natural number k>l that is sufficiently large depending on C,£,H. 

• A real number X > 1 that is sufficiently large depending onC,£,H,5,k. 


We will implicitly assume these size relationships in the sequel to simplify the computations, for instance 
by absorbing a smaller error term into a larger if the latter dominates the former under the above 
assumptions. The reader may wish to keep the hierarchy 

1 1 

C<C - 

£ 0 

in mind in the arguments that follow. One could reduce the number of parameters in the argument by 
setting 5 := \/k, but this does not lead to significant simplifications in the arguments below. 

®michaelnielsen. org/polymathl/index .php?t it le= Bounded_discrepancy_multiplicative_functions_do, 
^Readers who are more comfortable with measure-theoretic notation than probabilistic notation may prefer to write the 
argument below starting from the failure of Theorem 1.9 rather than Theorem 1.8, replacing expectations with integrals, etc. 
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By Proposition 1.11, we see with probability 1 — 0(c) that there exists a Dirichlet character x of 
period q = Oe(l) and a real number t = Oe(X) such that 


I 

p<X 


1-Reg(p)z(p)p 

P 


1- 


(4.1) 


By reducing % if necessary we may assume that % is primitive. 
It will be convenient to cut down the size of t. 


Lemma 4.1. With probability 1 — 0(e), one has 

t = Oe{X^). (4.2) 

Proof. By Proposition 1.11 with X replaced by X^, we see that with probability 1 — 0(e), one can find a 
Dirichlet character x' of period q' = Oe(l) and a real number t' = Oe(X^) such that 


I 

p<X^ 


\-R&g{p)x'{p)p 

P 


<6 1 . 


We may restrict to the event that |t' —1| >X^, since we are done otherwise. Applying the pretentious 
triangle inequality (see [11, Lemma 3.1]), we conclude that 


I 

p<X^ 


\-R&X{p)x'{p)P 

P 


1. 


(4.3) 


The character XX' period Oe(l). Applying the Vinogradov-Korobov zero-free region for L{-,XX') 
(see [17, §9.5]), we see thatL{o + it,xx') / 0 for |t| > 10 and 

_((e_ 

(log|t|)2/3(loglog|t|)l/3 


for some Cg > 0 depending only on e; furthermore, an inspection of the Vinogradov-Korobov arguments 
(based on estimation of the logarithmic derivative of L{-,XX') the zero-free region) in fact yields the 
crude bound* 


\ogL{o + it,XX')\ <log^f''|t 


in this region (using a suitable branch of the logarithm), possibly after shrinking Cg if necessary. Using the 
contour-shifting arguments in [15, Lemma 2] and the bounds X^ < |t' —1| <Cg X, it is then not difficult to 
show that 


I 

exp ((logX)^/^) < p<X^ 


i-'Pqx{p)x'{p)p 

p 


> loglogA 


if X is sufficiently large depending on £, 5, a contradiction (note that the summands in (4.3) are nonnega¬ 
tive). The claim follows. □ 


*One can certainly improve the right-hand side here with a more careful argument; cf. [18, (11.6)]. But for the current 
application, a logarithmic bound will suffice. 
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Let us now condition to the probability 1 — 0{e) event that x, t exist obeying (4.1) and the bound 
(4.2); we can of course do this as e is assumed to be small. 

The bound (4.1) asserts that g “pretends” to be like the completely multiplicative function n i—)■ x{n)n‘^. 
We can formalise this by making the factorisation 

g{n) := xin)n’^h{n) (4.4) 


where x is the completely multiplicative function of magnitude 1 defined by setting x{p) •= X{p) for 
p\ q and ^(p) := g{p)p^‘*' for p\q, and h is the completely multiplicative function of magnitude 1 defined 
by setting h(p) := g{p)x{p)p ‘^ for p\ q, and h{p) = 1 for p\q. The function x should be compared 
with the function Xs in Example 1.4 and the function g in Example 1.6. 

With the above notation, the bound (4.1) simplifies to 


y, 1—Reh(p) 

kx P 


<Ce 1. 


(4.5) 


The model case to consider here is when t = 0 and h = 1, in which case g = X- In this case, one 
could skip directly ahead to (4.8) below. Of course, in general t will be non-zero (albeit not too large) 
and h will not be identically 1 (but “pretends” to be 1 in the sense of (4.5)). We will now perform some 
manipulations to remove the n'* and h factors from g and isolate medium-length sums (4.8) of the X 
factor, which are more tractable to compute with than the corresponding sums of g; then we will perform 
more computations to arrive at an expression (4.12) just involving x which we will be able to control 
fairly easily. 

We turn to the details. The first step is to eliminate the role of n'k Erom (1.3) and the triangle 
inequality we have 


E- 

H 




H' 

^ g{n + m) 

m—\ 


2 

< 1 


for all n (even after conditioning to the 1 — 0{e) event mentioned above). The pY,H<H'< 2 H averaging 
will not be used until much later in the argument, and the reader may wish to ignore it for the time being. 
By (4.4), the above estimate can be written as 


1 ^ 

E — ^ ^ ^(n-|-m)(n-|-m)'*h(?i-|-m) 

^ H<H'<2H m=\ 


<€. 1 . 


For n > we can use (4.2) and Taylor expansion to conclude that (n + mY^ = n'^ + H ^). The 
contribution of the error term is negligible, thus 


E- 

H 


H<H'<2H 


H' 

X{n + m)n‘*'h{n + m) 

m—\ 


2 

<1 


for all n > . We can factor out the n** factor to obtain 


it . 


E- 

H 


H<H'<2H 


H' 

X{n + m)h{n + m) 

m—\ 


2 

< 1 . 
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For n < we can crudely bound the left-hand side by H^. If 5 is sufficiently small, we can then sum 
weighted by conclude that 


E 


1 

H 


I I 

H<H'<2H n 


Lm=iX{n + m)h{n + m) 

„i+i/iogX ■ 


< logX. 


(The zeta function type weight of will be convenient later in the argument when one has to 

perform some multiplicative number theory, as the relevant sums can be computed quite directly and 
easily using Euler products.) Thus, with probability 1 — 0{e), one has from Markov’s inequality that 


s E E 

H<H'<2H n 


LnLiX{n + m)h{n + m) 
„l + l/logX 




We condition to this event, which we may do as e is assumed to be small. From this point onwards, our 
arguments will be purely deterministic in nature (in particular, one can ignore the boldface fonts in the 
arguments below if one wishes). 

We have successfully eliminated the role of we now work to eliminate h. To do this we will have 
to partially decouple the X and h factors in the above expression, which can be done^ by exploiting the 
almost periodicity properties of x as follows. Call a residue class a (q*^) bad if a -|- m is divisible by 
for some p|q and 1 < m < 2H, and good otherwise. We restrict n to good residue classes, thus 


- L L L 

^ H<H'<2Hae[l,q'‘], gOOdn=a (q*) 


Lm=iX{n + rn)h{n + m) 
„l + l/logX 


<e logX. 


By Cauchy-Schwarz, we conclude that 


- L L 

^ q*] good 


I 

n—a (q^) 


Lm=\X{n + m)h{n + m) 

„l + l/logX 


2 




log"^ 

q^ 


Now we claim that for n in a given good residue class a (q*^), the quantity xi^d- m) does not depend on n. 
Indeed, by hypothesis, {n + tn,c^) = (a-|-m,q^) is not divisible by for any p|qand is thus a factor of 


®The argument here was loosely inspired by the Maier matrix method [13]. 
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and 


n+m _ n+m 




is coprime to q. We then factor 

X{n + m) =x{{n + m,q^))x 
= X{{a + m,q^))X 
= X{{a + m,q^))X 


n + m 
{n + m,q^) 
n + m 
{a + m,q'^) 
a + m 
{a + m, q*') 

where in the last line we use the periodicity of X- Thus we have x{n + m) = x{a + m), and so 


1 

H 


I I 

H<H'<2Hae[\,ql<], gOOd 


T—’ ~ / \ v~’ + m^ 

Y^X{a + m) £ 


m—\ 


n—a (q^ 


,1 + l/logX 


<c 


log"^ 

q^ 


Shifting n by m, we see that 


h{n + m) 


y-i h(n) 

l+l/logX- L l+l/logA+'^^(l) 

n—a (q'^j n—a-\-m (q^j 


and thus (for X large enough) 


1 

H 


I I 

H<H’<2Haell,ql‘], gOOd 
log^X 


H' 


Y^X{a + m) £ 

m—\ n—a-^m (q^' 


h(») 

„l + l/logA 


(4.6) 


<C 


q^ 


Now, we perform some multiplicative number theory to understand the innermost sum in (4.6), with 
the aim of showing that the summand here is approximately equidistributed modulo q^. From taking 
Euler products, we have 

V- h(n) 


,1 + l/logX 


= e 


where (5 is the Euler product 


e := 


r\( 1_ 

11 pW/logX J 


Erom (4.5) and Mertens’ theorem one can easily verify that 

logX <Ce I ©I <Ce logX. 

More generally, for any Dirichlet character Xi we have 


(4.7) 


Xi{ri)h{n) 


i-i „l + l/logX 

« “ p 


n 1 


Kp)Xi{p) 

^1 + 1/logZ 
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If X\ is a non-principal character of period dividing q^, then the L-function L{s,x) '■= L« is analytic 
near 5 = 1, and in particular we have 

We conclude that 


y ZijnMn) _ / h(p)^yY 

-L^i+i/iogz + V y ^ 


<Cq,^ exp Y, 


|l-h(p)| \ 

pl + l/logX j 
1 -h(p)|^ 


<q.^ exp £ 

\p<X P 


^q,k exp £ 

\p<x 


0(l-Reh(p))i/2\ 

~P J 


Zijp) \ 

pl + l/logX J 


<.q,k exp ^(loglogX) £ -— 
<Cq,^ exp(Oe((loglogX)^/^)) 


where we have used the Cauchy-Schwarz inequality, Mertens’ theorem, and (4.5). For a principal 
character Xo of period r dividing we have 


y Zo^jf^) _ TT ^1 _ \ 

Z- 1 + 1/logZ “111 „l + l/logX I 

n “ pfr \ / 





e + Oe(l) 


thanks to (4.7) and the fact that h(p) = 1 for all p|r, and that all prime factors of r divide q and are thus 
of size Oe(l). By expansion into Dirichlet characters we conclude that 


I 

n—b (r) 


h(») 

„l + l/logX 


y + Oq.t: (exp (Og ((log logX) ^ /^))) 


for all r\q^ and primitive residue classes b (r). For non-primitive residue classes b (r), we write r = {b,r)r' 
and b = {b,r)b'. The previous arguments then give 


I 

n—b' (r') 


Hn) 

„l + l/logX 


^ + OqT (exp (Og ((log log X) ^ /^))) 
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which since h((Z7,r)) = 1 gives (again using (4.7)) 


I 

n—b (r) 


h(») 

„l + l/logX 


— + Oq.-t (exp (Oe ((log logX) ^ /^))) 
r 


for all b (r) (not necessarily primitive). Inserting this back into (4.6) we see that 


- L L 

^ H<H'<2H a6[l V] gOOd 



+ Oq,,t(exp(Oe((loglogX)i/2)))^ 




lo^ 


The contribution of the Oq ,t(exp(Oe((loglogX)^/^))) error term here can be shown by (4.7) to be at most 
Cg log^X/in magnitude if X is large enough, for any Cg > 0 depending only on e. Removing this error 
term and then applying (4.7) again to cancel off the 6 term, we conclude that 


1 

q^ 


I 

a6[l,q*] good 


1 

H 


I 

H<H'<2H 


H' 


^ X{a + m) 

m—\ 


1- 


(4.8) 


We have now eliminated both t and h. The remaining task is to establish some lower bound on the 
discrepancy of medium-length sums of x that will contradict (4.8). As mentioned above, this will be a 
more complicated variant of the analysis in Examples 1.4, 1.5, 1.6 in which the perfect orthogonality in 
Example 1.5 is replaced by an almost orthogonality argument. 

We turn to the details. We first dispose of the easy case^*’ when q = 1. In that case x is identically 
one, and the left-hand side simplifies to ^ which is comparable to and leads to a 

contradiction since H is large. Thus we may restrict to the event that q > 1, so that the primitive character 
X is non-principal. 

Next, we expand (4.8) to obtain 

4 L L L Xia + mi)x{a + m2) 

H<H'<2Hmi,m2&[\,H'] ae[l,q*], good 

Write d\ := {a + m\,c^) and d 2 '■= (a-|-/n 2 ,q^), thus <ii,(i 2 |q^ ^ and for i = 1,2 we have 

X{a + mi) = x{di)x 

We thus have 

L Udi)I{d2)^ £ £ 

X 

as[l,q*], gOOd:(a+mi,q*)=di,(a+m 2 ,q*)=ii 2 

*®In this case, many of the previous manipulations become degenerate, and one could have disposed of this case by a 
simplified version of the above arguments. 


a + mi\_/'a + m2\ j, 
di )^( d2 ) • 


a + mi\ 
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We reinstate the bad a. The number of such a is at most 

y < Hi-w 

% in/lf 

so their total contribution here is which is negligible, thus we may drop the requirement in 

(4.9) that a is good. 

Note that as % is already restricted to numbers coprime to q, and d\,d 2 divide q*^^^ we may replace 
the constraints (a + nr,-,q^) = di with + for i = 1,2. Summarising these modifications, we have 
arrived at the estimate 


E Udi)xid2)^ E E 


E ^ 

a&[l,q^]\di\a-]-m\;d2\a^m2 


a + m\ \ _ 

—F 


a + m2 
d2 


q^ 


(4.10) 


Consider the contribution to the left-hand side of (4.10) of an off-diagonal term di / da for a fixed 
choice of mi,m 2 . To handle these terms we use the Fourier transform to expand the character x{n) 
(which, as mentioned before Lemma 4.1, can be taken to be primitive) as a linear combination of 
e{^nlq) for ^ G (Z/qZ)^. Thus, the function n i—)> ldi\nX can be written as a linear combination 
of n I—)■ lrfj|„c(i§n/diq) for G Z coprime to q, which by Fourier expansion of the \d^\n factor (and 
the fact that all the prime factors of di also divide q) can in turn be written as a linear combination of 
n I—)• e{^n/diq) for {^,d\q) = 1. Translating, we see that the function 


^ ^ f lil \a+mi X 


a + mi\ 
di ) 


can be written as a linear combination of a i—)• c(i§a/diq) for (i^ ,diq) = 1. Similarly 


^ ^ ^d2\a+m2X 


a + m2\ 

d2 J 


can be written as a linear combination of a i—)• e{^a/d 2 q) for {^,d 2 q) = 1. If di F d 2 , then the fre¬ 
quencies involved here are distinct; since q^ is a multiple of both diq and d 2 q, we conclude the perfect 
cancellation^^ 


E ^ 

\a--\-mi;d2\aF-fn2 


—F 


a + m2 
d2 


= 0 . 


(4.11) 


Thus we only need to consider the diagonal contribution di = di to (4.10). For these diagonal terms we 
do not perform a Fourier expansion of the character X- The ^(di)x(d 2 ) terms helpfully cancel, and we 


" We thank Andrew Granville for observing this perfect cancellation, which allowed for some simplifications to this part of 
the argument. 
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obtain the bound 


L I. E 

d\q^ ' H<H'<2Hm[,m2€[l,H']ae[l,q^]'.d\a+nn,a+m2 


X 


a + m\ 
d 


_f a + m2\ 
) 


q^ 


(4.12) 


We have now eliminated %, leaving only the Dirichlet character % which is much easier to work with. We 
gather terms and write the left-hand side as 

E s E E 

^ H<H'<2Hae[l.,q^] 


I ^ 

m^[\^H'\.d\a-\-m 


a-\-m 


The summand in d is now non-negative. We can thus discard all the d that are not of the form d = q‘ with 
q' < \/H, to conclude that 


H 


I I 


i-.q‘<VH H<H'<2H aell.q'^] 


L X 


a + m 


<.e 


It is now that we finally take advantage of the averaging jj Y.h<h'<2H 1° simplify the m summation. 
Observe from the triangle inequality that for any H' G /I] and a G [l,q^] one has 


I ^ 

H' <m<H'+q'\q'\a+m 


a + m 

q' 





2 

<c 

I 

^ a + m\ 

+ 



mQ.[\,H'\\q'\a-\-m 

K H / 



I ^ 

nJ6 [1 ,H'+q'] :q' |a+m 


a + m 

q' 


summing over i,H’,awe conclude that 
^ 1 


-H 


I I 


I ^ 

H' <m<H'~^q':q'\a-\-m 


a-\-m 


<Ce q^ 


H'e[H,3Hl2]ae[l,a^] 

In particular, by the pigeonhole principle there exists H' G [H,3H/2] such that 

2 


I I 

i:q'<\/ff ci£[l,q*] 


I ^ 

H'<m<H'+q':q'|a+m 


a + m 


<Ce q^ 


Shifting a by H' and discarding some terms, we conclude that 


I I 

/:q'<^ae[l,q*/2] 


I ^ 

0<m<q':q'|£i+m 


a + m 

q' 


q 
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Observe that for a fixed a there is exactly one m in the inner sum, and = 


+ 1. Thus we have 


I I 

i:q‘<VHae[l,q'‘/2] 


X 


+ 1 


q'‘ 


Making the change of variables b := 


+ 1 and discarding some terms, we thus have 


E q' E «. q‘. 

i:q‘<VH be[l,q^-‘/A] 

But b I—;■ is periodic of period q with mean 1, thus 

I lzWP»rq'-' 

fe6[l,q*-'74] 

which when combined with the preceding bound yields 


I l«rl, 

i:q‘<VH 


which leads to a contradiction for H large enough (note the logarithmic growth in H here, which is 
consistent with the growth rates in Example 1.5). The claim follows. 


5 Sums of multiplicative functions 


One corollary of Corollary 1.2 is that if/:N—)•{—l,+l}isa completely multiplicative function, then 


sup| X^/(j)| = +°°- 

« j=i 


One can ask (as was done in [9]) whether the same claim holds if / is only assumed to be multiplicative 
rather than completely multiplicative. As noted in [6], there is a simple counterexample, namely the 
multiplicative function >•{ —1,+1} defined by setting X 2 {n) ■= + l when n is odd and X 2 {n) := — 1 

when n is even. A bit more generally, any function of the form / = X 2 h is a counterexample, where 
h :N ^ {—1,+1} is a multiplicative function such that h{2^) = 1 for all j, and h{pj) = 1 for all but 
finitely many prime powers p^, since such functions are periodic with mean zero thanks to the Chinese 
remainder theorem. In the converse direction, the methods of this paper can be used to show 


Theorem 5.1. Let f : N —)> { — 1, + 1} be a multiplicative function such that sup„ | f{j)\ < +°°- Then 

/(2^) = — \ for all j, and 


^ f{p) 
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Informally, this theorem asserts that the only multiplicative functions / :N— —1,+1} with bounded 
sums are ones which “pretend” to be ;^ 2 - In [6], it was shown that if/:N—)■{ —l,+l}isa multiplicative 
function with /(2^) = 1 for some natural number j, and one had the asymptotic 'LpKxfip) = {C + 
o(l))log;c for some 0 < c < 1, then sup„ \ f {i)\ = +°°- This is implied by the above theorem. It 
seems likely that one can carry the analysis further and conclude that the periodic examples given above 
are the complete list of multiplicative / : N ^ { -1 , + 1} with sup„ I 1 /(;■) I < but we will not do 

so here. 

We sketch the proof of the above theorem as follows. Suppose that we have a multiplicative function 
/:N— ;■{ —1, + 1} such that | Ly=i f{i) I < C for all n and some finite C. Henceforth we allow implied 
constants to depend on C. Applying Proposition 1.11 (ignoring the stochasticity and the e parameter, 
and generalising from completely multiplicative functions to multiplicative functions as in Remark 3.1), 
we see that for each X > 1, one can hnd a Dirichlet character % of period q = 0{\) and a real number 
t = 0{X) such that 

y l-Re/(p)f(p)p-'^ 

kx P 

Actually, since / is real-valued, we may take t = 0 and % to be real-valued by the triangle inequality 
argument in [16, Appendix C]. Thus 


I 


i-/(p)x(p) 

P 


< 1 . 


Currently, % is allowed to depend on x, but the number of possible % is bounded independently of x, 
and so by the pigeonhole principle (or compactness) we can thus find a Dirichlet character % of period 
q = 0{l) such that 

Y 1 -f{p)x{p) j 

P P 

As before, we may assume without loss of generality that % is primitive. We then factor 


f = Xh 


where x is the multiplicative function such that ^(n) := x{n) for {n,q) = 1 and x{p^) •= f{p^) for p\q 
and j > 1, and h = f /% is also multiplicative taking values in {—1, -|-1}, with h{p^) = 1 whenever p\q 
and 7 > 1, and 


I 


1 -^(p) 
P 


< 1 . 


(5.1) 


We allow implied constants to depend on q,Xih. Suppose first that h{2^) = -fl for at least one natural 
number j. Then the Euler factors L7=o non-zero for every prime p (the only dangerous case 

being p = 2), and one can then check from (5.1) and Mertens’ theorem that the singular series 


s-ni- 

p j=oP 


Kp^) 

;(I + l/logX) 
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obeys the bounds 

logX <C S <C logX 

for all sufficiently large X (recall we allow implied constants to depend on h). One can then check that 
the argument in Section 4 (ignoring the stochasticity, the e parameter, and the complex conjugations, 
and setting t to zero) continues to work with very little modification (even though x and h are now 
only multiplicative rather than completely multiplicative) to give the desired contradiction for suitable 
choices of parameters H,k,X as before (the 5 parameter is irrelevant, since Lemma 4.1 is automatic in 
this setting). Thus we may assume that h{2^) = —1 for all j, which implies in particular that q is odd 
since h{p) = +1 for all p\q. 

The Euler product S now vanishes due to the p = 2 factor, which prevents us from applying the 
arguments in Section 4 immediately. To get around this, we now factor 

f = i'h' 


where % •= XiX and h' := Xih- One can check that the singular series 


obeys the bounds 


©'•=rry’.. _ 


logX <C 6' logX 


for sufficiently large X. If ^ / 1, one can then run the previous arguments with %, h replaced by , h' 
respectively (and replaced by 2 q^), arriving at the analogue 


1 

V 


1 


H' 


L X'{a + m) 

m—i 


< 1 


ae[l,2^*] good H<H'<2H 
of (4.8). But we can write (a+ m) =—;^ 2 («)X 2 (wj)x('^+ '”)> and hence 

1 ^ 


1 

V 


a6[l,2^*] gOoA^ 


H' 


L X2im)xia + m) 

m—l 


<1 


(5.2) 


and by repeating the rest of the arguments in Section 4 (carrying along the X 2 {>n) and 2 factors which end 
up being harmless) we again obtain a contradiction. Thus <7=1, and the theorem follows. 
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